14:03
2026-06-19
runtimewire.com
large-language-models
Head to head: grok-4.3 vs gpt-oss-120b
In a head-to-head comparison of four text tasks, xAI's grok-4.3 scored 37.8 against OpenAI's gpt-oss-120b at 34.4, winning on precision in fact-based summarization. The models tied on two coding tasksβ¦